Any plans for a Qwen3-32B model?

by wanghf - opened 5 days ago

wanghf

5 days ago

The most cost-effectiveness distillable model for now.

5 days ago

5 days ago

4 days ago

No way, probably. Since qwen3 32b is with no base model

4 days ago

damm

No way, probably. Since qwen3 32b is with no base model

4 days ago

No way, probably. Since qwen3 32b is with no base model

still could fine-tuning on instruct model, just may forget original Qwen3 knowledge (overrided by R1-0528 dataset).

4 days ago

•

fine-tuning on instruct model, just may forget original Qwen3 knowledge (overrided by R1-0528 dataset).

The https://huggingface.co/Qwen/Qwen2.5-Coder-32B-Instruct is a good candidate and the Q3 30A3 would be awesome/sweet too.')

4 days ago

The Llama 8b distills are so useful, I hope this one finds its way to that.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment